Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Bruno Scherrer And NotFranche-Comté

List of bibliographic references

Number of relevant bibliographic references: 89.
Ident.Authors (with country if any)Title
000453 Manel Tagorti [France] ; Bruno Scherrer [France]On the Rate of Convergence and Error Bounds for LSTD(λ)
000454 Boris Lesner [France] ; Bruno Scherrer [France]Non-Stationary Approximate Modified Policy Iteration
000A93 Bruno Scherrer [France]Approximate Policy Iteration Schemes: A Comparison
000B92 Manel Tagorti [France] ; Bruno Scherrer [France]Vitesse de convergence et borne d'erreur pour l'algorithme LSTD($\lambda$)
000B93 Bruno Scherrer [France]Une étude comparative de quelques schémas d'approximation de type iterations sur les politiques
000B95 Manel Tagorti [France] ; Bruno Scherrer [France]Rate of Convergence and Error Bounds for LSTD($\lambda$)
000D16 Matthieu Geist [France] ; Bruno Scherrer [France]Off-policy Learning with Eligibility Traces: A Survey
000D49 Eugene A. Feinberg [États-Unis] ; Jefferson Huang [États-Unis] ; Bruno Scherrer [France]Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming
000F08 Bruno Scherrer [France]Improved and Generalized Upper Bounds on the Complexity of Policy Iteration
000F09 Victor Gabillon [France] ; Mohammad Ghavamzadeh [France] ; Bruno Scherrer [France]Approximate Dynamic Programming Finally Performs Well in the Game of Tetris
000F29 Alain Dutech [France] ; Bruno Scherrer [France] ; Christophe Thiery [France]La carotte et le bâton... et Tetris
001120 Bruno Scherrer [France] ; Boris Lesner [France]Sur l'utilisation de politiques non-stationnaires pour les processus de décision Markoviens à horizon infini
001122 Bruno Scherrer [France]Quelques majorants de la complexité d'itérations sur les politiques
001130 Manel Tagorti [France] ; Bruno Scherrer [France] ; Olivier Buffet [France] ; Joerg Hoffmann [France]Abstraction Pathologies In Markov Decision Processes
001172 Manel Tagorti [France] ; Bruno Scherrer [France] ; Olivier Buffet [France] ; Joerg Hoffmann [France]Abstraction Pathologies In Markov Decision Processes
001183 Bruno Scherrer [France] ; Matthieu Geist [France]Policy Search: Any Local Optimum Enjoys a Global Performance Guarantee
001194 Bruno Scherrer [France]On the Performance Bounds of some Policy Search Dynamic Programming Algorithms
001244 Boris Lesner [France] ; Bruno Scherrer [France]Tight Performance Bounds for Approximate Modified Policy Iteration with Non-Stationary Policies
001334 Bruno Scherrer [France]Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris
001750 Matthieu Geist [France] ; Bruno Scherrer [France]Off-policy Learning with Eligibility Traces: A Survey
001825 Bruno Scherrer [France] ; Boris Lesner [France]On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes
001A68 Bruno Scherrer [France] ; Mohammad Ghavamzadeh [France] ; Victor Gabillon [France] ; Matthieu Geist [France]Approximate Modified Policy Iteration
001B39 Bruno Scherrer [France] ; Victor Gabillon [France] ; Mohammad Ghavamzadeh [France] ; Matthieu Geist [France]Approximate Modified Policy Iteration
001C03 Bruno Scherrer [France]On the Use of Non-Stationary Policies for Infinite-Horizon Discounted Markov Decision Processes
002138 Matthieu Geist [France] ; Bruno Scherrer [France]l1-penalized projected Bellman residual
002139 Bruno Scherrer [France] ; Matthieu Geist [France]Recursive Least-Squares Learning with Eligibility Traces
002267 Victor Gabillon [France] ; Alessandro Lazaric [France] ; Mohammad Ghavamzadeh [France] ; Bruno Scherrer [France]Classification-based Policy Iteration with a Critic
002279 Bruno Scherrer [France] ; Matthieu Geist [France]Moindres carrés récursifs pour l'évaluation off-policy d'une politique avec traces d'éligibilité
002378 Victor Gabillon [France] ; Alessandro Lazaric [France] ; Mohammad Ghavamzadeh [France] ; Bruno Scherrer [France]Classification-based Policy Iteration with a Critic
002841 Bruno Scherrer [France]Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris
002C27 Bruno Scherrer [France]Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view
002C29 Christophe Thiery [France] ; Bruno Scherrer [France]Least-Squares λ Policy Iteration: Bias-Variance Trade-off in Control Problems
002C72 Christophe Thiery [France] ; Bruno Scherrer [France]Least-Squares λ Policy Iteration : optimisme et compromis biais-variance pour le contrôle optimal
003231 Bruno Scherrer [France] ; Christophe Thiery [France]Performance bound for Approximate Optimistic Policy Iteration
003232 Alain Dutech [France] ; Bruno Scherrer [France]Partially Observable Markov Decision Processes
003565 Christophe Thiery [France] ; Bruno Scherrer [France]Une approche modifiée de Lambda-Policy Iteration
003859 Christophe Thiery ; Bruno ScherrerConstruction d’un joueur artificiel pour Tetris
003C68 Christophe Thiery [France] ; Bruno Scherrer [France]Improvements on Learning Tetris with Cross Entropy
003C92 Christophe Thiery [France] ; Bruno Scherrer [France]Building Controllers for Tetris
003D48 Bruno Scherrer [France] ; Shie Mannor [Canada]Error Reducing Sampling in Reinforcement Learning
003D49 Cesar Torres-Huitzil [Mexique] ; Bernard Girau [France] ; Amine Boumaza [France] ; Bruno Scherrer [France]Embedded harmonic control for trajectory planning in large environments
003D50 Marek Petrik [États-Unis] ; Bruno Scherrer [France]Biasing Approximate Dynamic Programming with a Lower Discount Factor
004139 Alain Dutech [France] ; Bruno Scherrer [France] ; Christophe Thiery [France]La carotte et le bâton... et Tetris
004273 Amine Boumaza [France] ; Bruno Scherrer [France]Analyse d’un algorithme d’intelligence en essaim pour le fourragement
004474 Alain Dutech [France] ; Bruno Scherrer [France]Processus décisionnels de Markov partiellement observables
004599 Bernard Girau [France] ; Amine Boumaza [France] ; Bruno Scherrer [France] ; Cesar Torres-Huitzil [Mexique]Block-synchronous harmonic control for scalable trajectory planning
004648 Amine Boumaza [France] ; Bruno Scherrer [France]Convergence and rate of convergence of simple ant models
004725 Amine Boumaza [France] ; Bruno Scherrer [France]Convergence and Rate of Convergence of a Foraging Ant Model
004952 Amine Boumaza [France] ; Bruno Scherrer [France]Convergence and rate of convergence of a simple ant model
004958 Amine Boumaza [France] ; Bruno Scherrer [France]Optimal control subsumes harmonic control
004E85 Amine Boumaza [France] ; Bruno Scherrer [France]Convergence and rate of convergence of a simple ant model
004F65 Bruno Scherrer [France]Une condition suffisante pour l'implémentation connexionniste asynchrone
005694 Amine Boumaza [France] ; Bruno Scherrer [France]Convergence et taux de convergence d'un algorithme fourmi simple
005706 Amine Boumaza [France] ; Bruno Scherrer [France]Optimal control subsumes harmonic control
005989 Amine Boumaza [France] ; Bruno Scherrer [France]Navigation, fonctions harmoniques et contrôle optimal stochastique
005C50 Bruno Scherrer [France]Asynchronous Neurocomputing for optimal control and reinforcement learning with large state spaces
005D52 Bruno ScherrerAsynchronous Neurocomputing for optimal control and reinforcement learning with large state spaces
005D65 Amine Boumaza ; Bruno ScherrerNavigation, fonctions harmoniques et contrôle optimal stochastique
006E36 Bruno Scherrer [France]Approche connexionniste du contrôle optimal
007022 Bruno Scherrer [France] ; Shie Mannor [États-Unis]Error reducing sampling in reinforcement learning
007196 Bruno Scherrer [France]Modular self-organization for a long-living autonomous agent
007272 Bruno Scherrer [France]Parallel asynchronous distributed computations of optimal control in large state space Markov Decision Processes
007292 Bruno Scherrer [France]Apprentissage de représentation et auto-organisation modulaire pour un agent autonome
007530 Bruno ScherrerModular self-organization for a long-living autonomous agent
007531 Bruno ScherrerModular self-organization for a long-living autonomous agent
007608